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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



International Application No.: PCT/EP97/05441 

International Filing Date: October 2, 1997 

For: RECOMBINANT PROCESS FOR 

PREPARING A COMPLETE MALARIA 
ANTIGEN, gp!90/MSPl 



Assistant Commissioner for Patents 
Washington, D.C. 20231 

Dear Sir: 

Prior to examination of the above-identified application, please cancel claims 1-41 (all 
original claims), without prejudice or disclaimer, and add the new claims 42-82, shown below. 




In the application of: 



Examiner: Unassigned 
Group Art Unit: Unassigned 



Hermann BUJARD et al. 



PRELIMINARY AMENDMENT 



pa-364638 



I. AMENDMENT 



In the specification 

At page 1, after the title, please insert the following sentence. 

- This application is a-35 U.S.C. §371 filing of International Patent Application No. 
PCT/EP97/05441, filed October 2, 1997. This application claims priority benefit of German 
Patent Application No. DE 19640817.2, filed October 2, 1996. 

In the Claims : 

Please cancel claims 1-41, without prejudice or disclaimer. Please add new claims 42-82, 
shown below. 

~ 42. A method for producing a complete gpl90/MSPl protein of a Plasmodium, comprising 
expressing a nucleotide sequence encoding the complete gpl90/MSPl protein in an 
expression vector, wherein the AT content of the expressed nucleotide sequence encoding 
the complete gpl90/MSPl protein is less than the AT content of a naturally occurring 
nucleotide sequence encoding a gpl90/MSPl protein. 

43. The method of claim 42, wherein the Plasmodium is a strain of Plasmodium falciparum. 

44. The method of claim 43, wherein the strain of Plasmodium falciparum is P. falciparum 
strain FCB-1. 

45. The method of claim 42, wherein the AT content is reduced from about 74% to 55%. 

46. The method of claim 42, wherein the nucleotide sequence encoding the complete 
gpl90/MSPl is denoted gpl90s in Figure 3C. 

47. The method of claim 42, wherein the nucleotide sequence encoding a gpl90/MSPl 
protein further comprises a nucleotide sequence encoding an attachment signal. 
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48. The method of claim 42, wherein the nucleotide sequence encoding a gpl90/MSPl 
protein further comprises a nucleotide sequence encoding a signal peptide. 

49. The method of claim 42, wherein nucleotide sequence encodes a gpl90/MSPl having the 
amino acid sequence selected from the group consisting of amino acids 1-1639, 1-1621, 
and 20-1621, depicted in Figure 3C. 

50. The method of claim 1 , wherein the nucleotide sequence encoding the complete 
gpl90/MSPl protein is produced according to a method comprising: 

(a) designing a nucleotide sequence encoding a gpl90/MSPl from a Plasmodium to 
contain codon frequencies common in the human genome, while maintaining the encoded 
gpl90/MSPl protein; 

(b) dividing the sequence of step (a) into individual overlapping regions; 

(c) synthesizing desoxynucleotides each extending the whole length of each individual 
region of step (b); 

(d) cloning the individual region coding sequences; and 

(e) fusing the regions to produce a sequence encoding gpl90/MSPl. 

51. The method of claim 50, wherein the nucleotide sequence being designed encodes a 
gpl90/MSPl protein from Plasmodium falciparum FCB-1, and the overlapping regions 
of step (b) encode p83, p31, p36, p30, and pi 9, and wherein the method further 
comprises a step (c') which is performed after step (c) and comprises synthesizing coding 
regions encoding gpl9, gp30, p36 and p31 by polymerase chain reaction and synthesizing 
a coding region encoding p83 by fusion of two sequences comprising approximately 
1200 bp. 
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52. The method of claim 50, wherein the desoxyoligonucleotides synthesized in step (c) are 
on average 120 nucleotides long, and wherein the overlap between individual regions is 
about 20 bases. 

53. The method of claim 42, wherein the expression vector is selected from the group 
consisting of dPS56RBSII 5 pBi-5, and ppTMCS. 

54. The method of claim 42, wherein the nucleotide sequence is expressed in an Escherichia 
coli (E. colt) strain. 

55. The method of claim 54, wherein the E. coli strain is DHSalphaZl . 

56. The method of claim 42, wherein the nucleotide sequence is expressed in an expression 
system selected from the group consisting of HeLa cells and CHO cells. 

57. The method of claim 42, wherein the nucleotide sequence is expressed in an expression 
system selected from the group consisting of Toxoplasma gondii and Leishmania. 

58. A nucleotide sequence encoding a complete gpl90/MSPl protein of a Plasmodium, 
wherein the AT content of the nucleotide sequence encoding the complete gpl90/MSPl 
protein is less than the AT content of a naturally occurring nucleotide sequence encoding 
a gpl90/MSPl protein. 

59. The nucleotide sequence of claim 58, further comprising a nucleotide sequence encoding 
an attachment signal. 

60. The nucleotide sequence of claim 58, further comprising nucleotide sequences encoding a 
signal peptide. 

61 . The nucleotide sequence of claim 58, wherein the sequence further comprises a sequence 
encoding an N-terminal extension of about 1 1 amino acids, of which six are histidine. 

62. The nucleotide sequence of claim 58, wherein the sequence lacks sequences that function 
as splice donor and splice acceptor signals. 
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63 . The nucleotide sequence of claim 58, wherein the sequence lacks a region comprising 
GC-rich sequences. 

64. The nucleotide sequence of claim 58, wherein the sequence lacks recognition sequences 
for restriction enzymes which recognize sequences of six or more base pairs. 

65. The nucleotide sequence of claim 58, wherein the nucleotide sequence comprises one or 
more cleavage sites for restriction endonucleases, and wherein each of said cleavage sites 
occurs no more than once in said nucleotide sequence. 

66. The nucleotide sequence of claim 58, further comprising cleavage sites for endonucleases 
at the 5' and 3' ends of the nucleotide sequence, said endonuclease cleavage sites being 
absent from the remainder of the nucleotide sequence and from an expression or cloning 
vector containing said nucleotide sequence. 

67. The nucleotide sequence of claim 58, wherein the nucleotide sequence encoding the 
complete gpl90/MSPl consists of a sequence denoted gpl90s in Figure 3C. 

68. The nucleotide sequence of claim 58, wherein the AT content is reduced from about 74% 
to about 55%, compared to the AT content of the naturally occurring nucleotide 
sequence. 

69. The nucleotide sequence of claim 58, wherein nucleotide sequence encodes a 
gpl90/MSPl having the amino acid sequence selected from the group consisting of 
amino acids 1-1639, 1-1621, and 20-1621 of the sequence denoted gpl90s in Figure 3C. 

70. A vector comprising the nucleotide sequence of claim 58. 

71 . The vector of claim 70, wherein the vector is an expression vector. 

72. The vector of claim 7 1 , wherein the expression vector is selected from the group 
consisting of dPS56RBSII, pBi-5, and ppTMCS. 

73. A host organism comprising the nucleotide sequence of claim 58. 
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74. The host organism of claim 73, wherein the host organism is an Escherichi coli (E. coli) 
strain. 

75. The host organism of claim 74, wherein the E. coli strain is DHSalphaZl . 

76. The host organism of claim 73, wherein the host organism is a cell selected from the 
group consisting of a HeLa cell and a CHO cell. 

77. The host organism of claim 76, wherein the cell constitutively synthesizes a tetracycline 
transactivator. 

78. The host organism of claim 73, which is selected from the group consisting of 
Toxoplasma gondii, Leishmania, baculovirus, adenovirus, and yeast. 

79. A method of delaying development of malaria parasite Plasmodium parasitemia in an 
animal, comprising administering gpl90/MSPl to the animal. 

80. A composition for use as a vaccine, said composition comprising a nucleotide sequence 
according to claim 58. 

81. A composition comprising a host organism according to claim 73 . 

82. A method for stabilizing a gene sequence, comprising reducing the AT content of the 
sequence, compared to the AT content of the naturally occurring gene sequence. 

II. REMARKS 

The PCT application contained claims 1-41. By this Preliminary Amendment, 
Applicants have cancelled original claims 1-41, without prejudice or disclaimer, and have added 
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new claims 42-82. Claims 42-82 are now pending in this application. Support for the new 
claims is found in the original claims and throughout the specification as filed. 



Prior to examination, please enter the amendments submitted herein. 

If a telephone call would further prosecution of this case, the Examiner is invited to call 
the undersigned at (650) 813-5776. In the unlikely event that the transmittal letter is separated 
from this document and the Patent Office determines that an extension and/or other relief is 
required, applicant petitions for any required relief including extensions of time and authorizes 
the Assistant Commissioner to charge the cost of such petitions and/or other fees due in 
connection with the filing of this document to Deposit Account No. 03-1952 referencing docket 
no. 4021620Q0200 . However, the Assistant Commissioner is not authorized to charge the cost of 
the issue fee to the Deposit Account. 



IH. SUMMARY 



Respectfully submitted, 



Dated: 



April 1, 1999 




Paula A. Borden ^ 
Registration No. 42,344 
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Telephone: (650) 813-5776 

Facsimile: (650) 494-0792 
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□ Nonprofit Organization 
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having rights to the invention averring to their status as small entities. (37 C.F.R. § 1 .27) 

I acknowledge the duty to file, in this application or patent, notification of any change in status 
resulting in loss of entitlement to small entity status prior to paying, or at the time of paying, the earliest of 
the issue fee or any maintenance fee due after the date on which status as a small entity is no longer 
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statements may jeopardize the validity of the application, any patent issuing thereon, or any patent to which 
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I have not assigned, granted, conveyed or licensed and am under no obligation under contract or law 
to assign, grant, convey or license, any rights in the invention to any person who would not qualify as an 
independent inventor under 37 CJF.R. § 1 .9(c) if that person had made the invention, or to any concern 
which would not qualify as a small business concern under 37 C.F.R. § 1 .9(d) or a nonprofit organization 
under 37 C.F.R. § 1.9(e). 

Each person, concern or organization to which I have assigned, granted, conveyed, or licensed or am 
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Germany 


S Individual ^ " 

□ Small Business Concern 

□ Nonprofit Organization 



NOTE: Separate verified statements are required from each named person, concern or organization 
having rights to the invention averring to their status as small entities. (37 C.F.R. § 1 .27) 

I acknowledge the duty to file, in this application or patent, notification of any change in status 
resulting in loss of entitlement to small entity status prior to paying, or at the time of paying, the earliest of 
the issue fee or any maintenance fee due after the date on which status as a small entity is no longer 
appropriate. (37 C.F.R. § 1.28(b)) 
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imprisonment, or both, under section 1001 of Title 18 of the United States Code, and that such willful false 
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Recombinant process for preparing a complete malaria antigen, gp190/MSP1 

The invention concerns a recombinant manufacturing process for the complete malaria 
antigen gp190/MSP1, as well as separate naturally-occurring domains and parts of the same, 
by expression of a synthetic DNA sequence. The invention concerns in addition the DNA 
sequences produced by the process and the host organisms used for the expression of the 
DNA sequences. In addition the invention concerns the use of the complete malaria antigen 
as well as parts thereof as a vaccine for immunization against malaria. 

Finally the invention under consideration concerns a stabilization process 

for AT-rich genes, as well as stabilized genes which are characterized by a reduced AT 

content. 

Malaria is one of the most significant infectious diseases in the world. According to WHO 
reports, in 1 990 40% of the world population in 99 countries was exposed to the risk of 
malaria. At the same time its distribution is enormously on the increase. This may be ascribed 
above al! to intensive development of resistance in the parasites causing malaria, promoted 
by the recommendation and use as prophylactics of the drugs intended for treatment. Besides 
the search for new and effective chemotherapeutic agents hope is nowadays directed towards 
the development of vaccines, since people in areas of the world where malaria is epidemic do 
manage to develop some kinds of immunity. As well as a natural resistance to malaria, such 
as that found in heterozygous carriers of the sickle-cell gene and people with thalassaemia 
and glucose-6-phosphate dehydrogenase deficiency, in the course of malarial infection in 
humans immune mechanisms can be stimulated which express themselves in a heightened 
capacity for resistance to the Plasmodia. Consequently the course of the disease in 
populations exposed to severe epidemics is generally less threatening than in persons 
exposed to the infection less frequently or for the first time. 

The main problem in the development of a vaccine is the identification of an antigen which can 
induce protective immunity, since there is no easily accessible well-defined animal model 
available for the four parasites affecting man. The organism causing malaria belongs to the 
Plasmodium group, of which infection with one of the four parasites Plasmodium vivax, 
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Plasmodium ovale, Plasmodium malariae or Plasmodium falciparum results from the bite of 
Anopheles mosquitoes. Of these parasites Plasmodium falciparum is the most dangerous and 
the most widely distributed. 

The main surface protein of the merozoite, the invasive form of the blood stage of the malaria 
parasite Plasmodium falciparum and other malaria parasites such as P. vivax, is a 190-220 
kD glycoprotein. Late in the development of the parasite this precursor is processed into 
smaller proteins, which can however be isolated from merozoites as a unitary complex. By 
means of a glycosylphosphatidyl-inositol bond this complex is coupled to the merozoite 
membrane. The sequences of the gp190 proteins of various P. falciparum strains fall into two 
groups, between which intragenic recombination is frequent. In general the protein consists of 
many highly conserved regions, of a dimorphic zone to which in each case one of two alleles 
belongs, and of two relatively small oligomorphic blocs in the N-terminal zone (Tanabe, K., 
Mackay, M., Goman, M. and Scaife, J.G. (1987), Allelic dimorphism in a surface antigen gene 
of the malaria parasite Plasmodium falciparum. J. Mol. Biol. 195, 273-287, Miller, L.H., 
Roberts, T., Shahabuddin, M. and McCutchan, T.F. (1993), Analysis of sequence diversity in 
the Plasmodium falciparum merozoite surface protein-1 (MSP-1). Mol. Biochem. Parasitol. 59, 
1-14). 

Already early on gp190/MSP1 was considered as a possible candidate for a vaccine. In the 
rodent model active protection against infection with rodent parasites was obtained following 
immunization with the analogous protein. Passive protection could be procured with antibodies 
directed against this protein (see also Holder, A.A. and Freeman, R.R. (1981), Immunization 
against blood-stage rodent malaria using purified parasite antigens, Nature 294, 361-364; 
Marjarian, W.R., Daly, T.M., Weidanz, W.P. and Long, C.A. (1984), Passive immunization 
against murine malaria with an lgG3 monoclonal antibody, J. Immunol. 132, 3131-3137). The 
data which ought to support this assumption are nevertheless in details not statistically 
significant. 

There are in addition, a number of monoclonal antibodies which in vitro inhibit the invasion of 
erythrocytes by P. falciparum and are directed against gp190/MSP1 (Pirson, P.J. and Perkins, 
M.E. (1985), Characterization with monoclonal antibodies of a surface antigen of Plasmodium 



falciparum merozoites. J. Immunol. 134, 1946-1951; Blackman, M.J., Heidrich, H.-G., 
Donachie, S., McBride, J.S. and Holder, A.A. (1990), A single fragment of a malaria merozoite 
surface protein remains on the parasite during red-cell invasion and is the target of invasion- 
inhibiting antibodies. J. Exp. Med. 172, 379-382). 

Finally, a series of vaccine studies have been carried out with gp190/MSP1 from P. falciparum 
on primates, particularly on Aotus and Saimiri monkeys (see also Perrin, L.H., Merkli, B., 
Loche, M., Chizzolini, C, Smart, J. and Richie, R. (1984), Antimalarial immunity in Saimiri 
monkeys. Immunization with surface components of asexual blood stages, J. Exp. Med. 160, 
441-451; Hall, R., Hyde, J.E., Goman, M., Simmons, D.L., Hope, I.A., Mackay, M. and Scaife, 
J.G. (1984), Major surface antigen gene of a human malaria parasite cloned and expressed in 
bacteria, Nature 311, 379-382; Siddiqui, W.A., Tarn, L.Q., Kramer, K.J., Hui, G.S.N., Case, 
S.E., Yamaga, K.M., Chang, S.P., Chan, E.B.T. and Kan, S.-C. (1987), Merozoite surface coat 
precursor protein completely protects Aotus monkeys against Plasmodium falciparum malaria, 
Proc. Natl. Acad. Sci. USA 84, 3014-3018; Ettlinger, H.M., Caspers, P., Materile, H., 
Schoenfeld H.-J., Stueber, D. and Takacs, B. (1991), Ability of recombinant or native proteins 
to protect monkeys against heterologous challenge with Plasmodium falciparum, Inf. Imm. 59, 
3498-3503; Holder, A.A., Freeman, R.R. and Nicholls, S.C. (1988), Immunization against 
Plasmodium falciparum with recombinant polypeptides produced in Escherichia coli, Parasite 
Immunol. 10, 607-617; Herrera, S., Herrera, M.A., Perlaza, B.L., Burki, Y., Caspers, P., 
Doebeli, H., Rotmann D. and Certa, U. (1990), Immunization of Aotus monkeys with 
Plasmodium falciparum blood-stage recombinant proteins, Proc. Natl, Acad. Sci. USA 87, 
4017-4021; Herrera, M.A., Rosero, F., Herrera, S., Caspers, P., Rotmann, D., Sinigaglia, F. 
and Certa, U. (1992), Protection against malaria in Aotus monkeys immunized with a 
recombinant blood-stage antigen fused to a universal T-cell epitope; correlation of serum 
gamma interferon levels with protection, Inf. Imm. 60, 154-158; Patarroyo, M.E., Romero, P., 
Torres, M.L., Clavijo, P., Moreno, A., Martinez A., Rodriquez, R., Guzmann, F. and Cabezas, 
E. (1987), Induction of protective immunity against experimental infection with malaria using 
synthetic peptides, Nature 328, 629-632). In these vaccine studies two premises may be 
distinguished: 

- Use of material isolated from parasites, and 
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r - Administration of material procured in heterologous systems of expression. 

The latter consists as a rule of relatively small segments of the total protein. Although the 
results of the inoculations carried out preliminarily on monkeys indicate that gp190/MSP1 
could bring about protection, all the experiments carried out on primates have two problems, 
which place such a conclusion in question: 

(a) they were carried out on too small groups of animals 

(b) they were not repeated. 

The results and the conclusions drawn from them are consequently not statistically confirmed. 
Besides the difficulty of access to suitable monkeys there remains the main basic problem, 
that it has so far not been possible to manufacture good vaccination material in a suitable 
quantity. 

On the other hand, after the sequencing of the gp190 gene from the K1 and MAD20 strains of 
Plasmodium falciparum overlapping fragments could be expressed in E. coii. With this 
material epidemiological studies in West Africa showed that in the adolescent group a 
correlation existed between antibody titre against gp190/MSP1 fragments on one hand and 
protection from parasite infection on the other. In addition the titre also appeared to correlate 
with the capacity to control the parasitaemia even at a low level (Tolle et al. (1993): A 
prospective study of the association between the human humoral immune response to 
Plasmodium falciparum blood stage antigen gp190 and control of malarial infections, Infect 
Immun. 61, 40-47). These results are supplemented by new investigations on Aotus monkeys 
in the framework of the present invention. Here an enhanced protection against infection with 
the parasite was attained because protein preparations from Plasmodium falciparum, which 
consisted predominantly of unprocessed gp190/MSP1, had been used as vaccine. The 
monkeys with the highest antibody titres against gp190/MSP1 were the best protected. These 
results eventually indicated gp190 as a most promising candidate for a vaccine against 
tropical malaria. 
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By some groups of workers the C-terminal domain of gp190 (p19 or p42) is assigned a 
particular role in the immunity mediated by gp190 (see also Chang, S.P., Case, S.E., Gosnell, 
W.L, Hashimoto, A., Kramer, K.J., Tarn, LQ., Hashiro, C.Q., Nikaido, CM., Gibson, H.L, Lee- 
Ng, C.T., Barr, P.J., Yokota, B.T. and Hui, G.S.N. (1996), A recombinant bacuiovirus 42- 
kilodalton C-terminal fragment of Plasmodium falciparum merozoite surface protein 1 protects 
Aotus monkeys against malaria, Inf. Imm. 64, 253-261; Burghaus, P.A., Wellde, B.T., Hall, T., 
Richards, R.L, Egan, A.F., Riley, E.M., Ripley-Bailou, W. and Holder A.A. (1996), 
Immunization of Aotus nancymai with recombinant C-terminus of Plasmodium falciparum 
merozoite surface protein 1 in liposomes and alum adjuvant does not induce protection 
against a challenge infection, Inf. Imm., in press. 

Thus far, however, it has also been impossible to exclude other parts of gp190 on a rational 
basis as irrelevant to a protective immune response. Hence it is as necessary as ever to use 
the entire gene or the intact gp190 for vaccine investigations. Despite multiple investigations 
by various work-groups, however, there has not yet been any success in cloning and 
expressing the entire gp190/MSP1 gene. 

Nor has it so far been possible to exclude a priori any part of the gp1 90 sequence as 
irrelevant to the protective immune response, so that it is as necessary as ever to use the 
entire gene or gene product for vaccine investigations. Nevertheless, despite many 
investigations by a number of working groups there has not yet been any successful cloning of 
the whole gene for gp1 90/MSP1 . 

One object of the present invention has consequently been to make available an adequate 
quantity of vaccine material in the form of the complete gp190/MSP1 . It was a further object of 
the present invention to provide a process by which this vaccine material could be recovered. 

In addition it was another object on the part of the present invention to provide a complete 
DNA sequence of gp190/MSP1 which could be expressed in a host organism. 

Yet another object of the present invention was to provide host organisms containing the 
complete gp190/MSP1 gene. 
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Finally, it was also an object of the present invention to provide a stabilization process for AT- 
rich genes, as well as a stabilized gene suitable for expression characterized in a reduction of 
the AT content. 

These objects are solved by the subject matter outlined in the Claims. 

In the following, certain concepts are explained in more detail in order to make clear how they 
should be understood in this context. 

"Recombinant manufacturing process" means that a protein of a DNA sequence is expressed 
by a suitable host organism in which the DNA sequence has arisen from cloning and fusion of 
individual DNA fragments. 

"Complete gp190/MSP1 protein" here means the entire gp190/MSP1 surface protein 
isolatable from the above named Plasmodia, especially Plasmodium falciparum, representing 
the main surface protein of the above named parasite as well as the proteins with analogous 
function from the other Plasmodium species such as P. vivax. The term therefore comprises in 
each case the main surface protein of the merozoites of the four malaria parasites named 
above as dangerous to man. "Complete gp190/MSP1 gene" means the gene coding for this 
protein. In this context "complete" signifies that the entire amino-acid sequence of the native 
protein is present or that the gene sequence codes for the entire amino-acid sequence of the 
native protein. Mutated and/or shortened forms of gp190/MSP1 are however included 
therewith insofar as they display the same immunization potential (vaccine protection) as the 
complete gp190/MSP1. Finally the term also includes variants of gp190/MSP1 characterized 
by containing in one molecule protein fragments of various alleles. 

"FCB-1" is a strain of P. falciparum such as that described in Heidrich, H.-G., Miettinen- 
Baumann, A., Eckerskorn, C. and Lottspeich, F. (1989) The N-terminal amino acid sequences 
of the Plasmodium falciparum (FCB1) merozoite surface antigens of 42 and 36 kilodalton, 
both derived from the 185-195-kilodalton precursor. Mol. Biochem. Parasitol. 34, 147-154. 
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"Attachment signal" here means a protein structure coded for by a DNA sequence at the 3' or 
5' end of the gene according to the invention. Attachment signals are structures enabling the 
attachment of a polypeptide to other structures, such as for example membranes. 

"Signal peptide" here signifies a protein structure for which a DNA sequence at the N-terminal 
end of the gene according to the invention codes. Signal peptides are structures which among 
other things enable penetration of the polypeptide into membranes. 

In the context of the present invention "AT-content" means the percentage amount of 
adenine-thymine base pairs compared to guanine-cytosine base pairs. 

"Cloning" will comprehend here all known state-of-the-art cloning methods which could be 
applied here, which are nevertheless not all described in detail because they belong among 
the normal tools of the person skilled in the art. 

"Expression in an appropriate expression system" should here include all known state-of-the- 
art methods of expression in known expression systems which could be applied here, but 
which are nevertheless not all described in detail because they belong among the normal 
tools of the person skilled in the art. 

It is a primary object in regard to the present invention that a process be provided by which the 
protein pg190/MSP1 and its gene can be produced in sufficient quantity without excessive 
cost. 

This object is solved by the recombinant manufacturing process set out in Claim 1 , by which a 
complete gp190/MSP1 gene and the protein coded by it are obtainable in sufficient quantities. 

For the first time it is possible by this process to synthesize the protein in its entirety outside 
the parasite. As the analysis with conformational epitope-recognizing monoclonal antibodies 
shows, the protein thus synthesized is at least reproducibly synthesizable over wide areas in 
naturally folded form. By the recombinant manufacturing process many milligrams of intact 
gp190/MSP1 could in every case be recovered from the host organism, a quantity which for 
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technical and economic reasons can never be recovered from parasites. Production of the 
protein in any desired quantity is now possible and opens new perspectives for its use as an 
experimental vaccine against malaria. Furthermore, the way is now open for the development 
of living vaccines as well as for vaccines based on nucleic acids. 

Synthesis of the gene sequence coding for the protein gp190/MSP1 is preferentially based on 
the sequence of the FCB-1 strain of P. falciparum. P. falciparum is the agent of tropical 
malaria and hence of the most dangerous among the types of malaria. The basic gene is a 
representative of the "K1 allele", where K1 stands for a particular P. falciparum strain. Its 
coding sequence extends over 4917 base pairs and includes a signal sequence at the N- 
terminal end as well as an attachment sequence at the C-terminal end. 

Q Furthermore, according to the invention the recombinant manufacturing process is 

fi preferentially characterized in having the AT content of the DNA sequence on which the 

0=i protein is based reduced relative to the wild type, from 74% in the original gene preferably to 

g about 55%, for example while the amino-acid sequence of the FCB-1 protein is maintained a 

H DNA sequence with the codon frequencies usual in the human genome is produced. Other 

J codon frequencies which reduce the AT content are also conceivable. 

O Preferentially the gene underlying the protein produced by the recombinant manufacturing 
=J process codes for the full amino-acid sequence including signal peptide and GPI attachment 
& signal peptide, further described as gp1 90 s . 

In another preferred embodiment, the gene on which the protein produced by the 
recombinant manufacturing process is based codes for the complete amino-acid sequence 
except for the GPI attachment signal. This embodiment is then described as gp190 s1 . 

In yet another preferred embodiment, the gene on which the protein produced by the 
recombinant manufacturing process is based codes for the complete amino-acid sequence 
except for the GPI attachment signal and the signal peptide. This embodiment is then 
described as gp190 S2 . 
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In a further preferred embodiment type, the gene on which the protein produced by the 
recombinant manufacturing process is based codes for the complete amino-acid sequence 
and a trans-membrane attachment sequence. 

In a particularly preferred embodiment the recombinant manufacturing process includes the 
following steps: 

In the first place the design of the DNA sequence to be synthesized on the basis of the gene 
from P. falciparum FCB-1, in which a DNA sequence with for example the codon frequencies 
common in the human genome is manufactured with retention of the amino-acid sequence of 
the FCB-1 protein. 

The AT content of the gene should be reduced by this, preferably to 55%. Further on in the 
process the planned sequence is divided for example into 5 overlapping regions, which at the 
same time correspond to domains of the natural processing products of gp190/MSP1 from 
FCB-1: p83, p31, p36, p30 and p19. 

Desoxyoligonucleotides are synthesized, which in each case extend the entire length of a 
region. 

The desoxyoligonucleotides so synthesized are particularly preferred where their sequence 
corresponds in an alternating manner to the "upper" (5' - 3') or the "lower" (3' - 5') DNA 
strand. The length of these oligonucleotides is preferably on average 120 nucleotides and 
they overlap the neighboring sequences in each case by about 20 bases. 

In one possible embodiment DNA sequences of about double the length of the existing end- 
products are manufactured by asymmetrical PCR, in effect so that the superfluous DNA 
sequences nearby in each case representee opposite strand. This leads in a second 

PCR amplification cycle to a second product corresponding to the length of four originally 
inserted oligonucleotides excluding the overlapping region. Transfer of these products to a 
preparation consisting predominantly of single-stranded DNA by asymmetrical PCR with the 
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terminal oligonucleotides permits the manufacture in a further amplification step of an 800-bp 
long double-stranded DNA fragment in only 25 PCR cycles. 

In this manner the regions coding for p19, p30, p36 and p31 are directly synthesized and 
molecularly cloned in E. coli. Clones with fault-free sequences are conserved either directly or 
by the joining up of fault-free sequence fragments. The region which codes for p83 is 
constructed by fusion from two sequences comprising about 1200 bp. 

In the further course of production single sequences are cloned. As expression vectors 
candidates preferred are the plasmids pDS56, RBS11 ("Hochuli, E., Bannwarth, W., Doebeli, 
H., Gentz, R. and Stueber, D. (1988) Genetic approach to facilitate purification of recombinant 
proteins with a novel metal chelate adsorbent. Biotechn. 6, 1321-1325"), pBi-5 ("Baron, U., 
Freundlib, S., Gossen, M. and Bujard, H. (1995) Corregulation of two gene activities by 
tetracycline via a bidirectional promoter. Nucl. Acids Res. 23, 3605-3606") and ppTMCS. It is 
possible nonetheless also to conceive of other expression vectors. 

Host organisms preferred for expression are E.coli, with the strain DH5alphaZ1 especially 
preferred (R. Rutz, Dissertation 1996, Heidelberg University), HeLa cells, CHO cells, 
Toxoplasma gondii (Pfefferkorn, E.R. and Pfefferkorn, C.C. 1976, Toxoplasma gondii: Isolation 
and preliminary characterization of temperature-sensitive mutants. Exp. Parasitol. 39, 365- 
376) or Leishmania. Additional host systems might be e.g. yeasts, baculoviruses or 
adenoviruses, so that the subject matter of the invention should not be limited to the host 
systems mentioned. 

A further object of the present invention has been to provide a complete DNA sequence, 
suitable for expression, of the gp190/MSP1 surface protein of P. falciparum. 

This object is solved by the invention named in Claim 17, by which the sequence can be 
obtained by the recombinant manufacturing process described above. 

In a preferred embodiment of the present invention the sequence suitable for expression 
codes for the complete amino-acid sequence. 
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In another preferred embodiment of the present invention the sequence suitable for 
expression codes for the complete amino-acid sequence except for the attachment signal. 

In a further preferred embodiment according to the present invention the DNA sequence 
suitable for expression codes for the complete amino-acid sequence except the attachment 
signal and the peptide signal. This embodiment of gp190/MSP1 can hence be characterized in 
including at the N-terminus 1 1 additional amino-acids, of which 6 are histidines. 

Particularly preferred the DNA sequence suitable for expression contains no recognizable 
"splice-donor" and "splice-acceptor" sites, and is preferably characterized in not containing 
any larger GC-rich sequences which might result in stable hairpin structures at the RNA level. 

Recognition signals for restriction enzymes which recognize sequences of six or more base 
pairs should preferably be avoided. 

In a preferred embodiment specific cleavage sites for restriction endonucleases, occurring 
only once in the gene, are introduced into regions to separate the existing domains following 
processing of the protein. 

Particularly preferred would be the presence at both ends of the gene of sequences for 
restriction endonucleases which do not occur in the gene. 

Furthermore host organisms containing the complete sequence of gp190/MSP1 surface 
protein are provided by the invention. 

Such host organisms are preferably E. coli, particularly preferred being the strain 
DH5alphaZ1, HeLa cells, CHO cells, Toxoplasma gondii or Leishmania. The HeLa and CHO 
cells ought preferably to synthesize constitutively tTA. 
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Finally the present invention provides a possibility of using a gp190/MSP1 surface protein 
created produced according to the recombinant manufacturing process, or parts thereof, for 
active immunization against malaria. 

The scheme for synthesis presented here also permits manufacture of the second allele of the 
gp190/MSP1 gene, whereby the dimorphism of the protein is also taken into account. The 
main variability of the protein depends however on the sequences of two relatively short blocs, 
blocks II and IV (ref. 1), which are oligomorphic. The present sequence data make it possible 
to disclose over 95% of all known gp190/MSP1 sequences with 6-8 sequence combinations of 
these blocs. The synthesis of these sequence variants can be brought about problem-free by 
means of the strategies proposed here, so that variants can be built up both in the K1 and in 
the MAD20 allele. Vaccines from the families of sequences thus created can confer protection 
where required against a wide spectrum of parasites with gp190/MSP1 variants. 

The manufacture of different types of vaccine is possible: 

- At the level of protein preparations, where in each instance mixtures of the two families (K1 
type, MAD20 type with different variants of Blocs II and IV) can come into use. Various carrier 
or adjuvant materials could be added: aluminum oxide, liposomes, IscomsQSzl , etc. 

- At the level of live vaccines: (a) viral carriers, especially vaccinia and adenoviruses; (b) 
parasites as carriers, particularly avirulent forms of Leishmania and Toxoplasma; (c) bacterial 
carriers, e.g. Salmonella. 

- At the level of nucleic acids, whereby for example vectors suitable for gene therapy would 
be used to introduce the gene into the host; beyond that the introduction of nucleic acids 
coding for the desired protein can be envisaged. 

A further possibility for vaccination lies in the use of a gp1 90/MSP1 protein produced 
according to the recombinant manufacturing process set out by the invention, for the 
production of monoclonal antibodies which can then be used in their turn for passive 
immunization against malaria. 
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Similarly it becomes possible to use the DNA sequence on which the protein is based at an 
intermediate stage arising in the course of the recombinant manufacturing process for the 
construction of a vaccine based on nucleic acids. 

Finally the invention also concerns a process for the stabilization of gene sequences, 
especially for sequences which do not show adequate stability in expression systems. 

According to the invention this stabilization is attained because the AT content of the 
sequence is reduced. 

Moreover a stabilized gene characterized by having a reduced AT content is provided by the 
invention. An example of such a stabilized gene is the gene for gp190/MSP1 surface protein 
according to the present invention. 

In the following the invention will be described with the help of figures and tables as well as 
some examples in individual embodiments. 

They show: 

Fig. 1: Schematic representation of the gp190/MSP1 precursor 
protein from P. falciparum (FCB-1). 

Fig. 2: Two vaccine trials carried out on Aotus monkeys with 
native gp190/MSP1 from P. falciparum (FCB-1). 

Fig. 2A: With 3 x 60 micrograms gp190/MSP1 

Fig. 2B: With 3 x 40 micrograms gp190/MSP1 

Fig. 3A: Strategy of synthesis of the gp190/MSP1 gene 
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Fig. 3B; Principle of PCR-based total synthesis 

Fig. 3C: Total sequencing of gp190 s 

Fig, 3D: N- and C-termini of gp190 s1 variant 

Fig. 4A: Expression vector pDS56 with gp190 S2 sequence 

Fig. 4B: Gel electrophoresis of gp190 S2 

Fig. 5A: Expression vector pBi-5 with gp190 81 sequence 

Fig. 5B: Immunofluorescence of HeLa cells 

Fig. 5C: Electrophoretic characterization of gp190 s1 
purified from HeLa cells 

Fig. 6A: Expression vector ppT 190 with gp190 sequence 

Fig. 6B: Immunofluorescence of the expression of gp190 s 
in T. gondii 

Fig. 6C: Polyacrylamide gel electrophoresis of gp190 from 
T. gondii 

In the gp190/MSP1 precursor protein from P. falciparum schematically represented in Fig. 1 
the dark blocs stand for regions which are strongly conserved in all strains. The cross-hatched 
blocs indicate the dimorphic areas, which in the case of the FCB-1 isolate derive from the K1 
allele. 01 and 02 indicate the oligomorphic areas. S denotes the peptide signal sequence 
containing 19 amino-acids, GA the C-terminal region, which includes the signal for the GPI 
attachment of the protein to the membrane. The arrows indicate the sites of the processing by 
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which the proteins p53, p31, p36, p30 and p19 arise. The gp190 gene codes for altogether 
1639 amino-acids. 

The other figures are more conveniently explained in the context of the following Examples. 
EXAMPLES 

Example 1: total synthesis of one of the DNA sequences coding for gp190/MSP1 (see Fig. 3) 

A. Strategy of synthesis of the qp190/MSP1 gene (gp190 s ) (see Fig. 3A). 

The sequence was divided into fragments corresponding to the main processing products: 
p83, p31, p36, p30 and p19. In the transition regions cleavage sites for restriction 
endonucleases (arrows in fig. 3) were inserted in such a way that the amino-acid sequence 
was not altered. All the particular cleavage sites are found only once in the sequence. 

The fragments were synthesized to overlap, so that the cleavage sites at the respective ends 
made attachment by ligation to the neighboring fragment possible. All individual fragments 
contain in addition at their 5' ends a BamHI division site for insertion into expression vectors. 
The entire sequence could be cloned via MIul and Cial. The scheme indicated here leads in 
addition to a sequence which cannot produce the GPI attachment since the C-terminal lacks 
18 amino-acids. Synthesis of a corresponding oligonucleotide as well as of a "primer" 
extending over the Sphl cleavage site, leads after PCR to the GA fragment which could be 
used by Sphl and Clal, the resulting total sequence being gp190 s On removing the sequence 
coding for the peptide signal, "PCR Primer" is produced, over which the fragment AS has 
been synthesized. It is permissible to alter the N-terminal via a BamHI and a Hindlll cleavage 
site in such a way that the protein begins with amino-acid no. 20. The nuclear sequence which 
encodes gp190/MSP1 without signal sequence and without GPI attachment signal was 
designated gp190S2. Deletion of the GPI attachment signal alone leads to gp190 s1 . 
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B. Principle of the PCR-supported total synthesis (see Fig. 3B) 

Oligodesoxynucleotides of about 120 nucleotides have been synthesized in an alternating 
manner from the coding or the non-coding strand in such a way that in each case about 20 
bases overlapped with the neighboring fragment. The scheme illustrates for example the 
synthesis of a fragment about 800bp long from oligonucleotides. At the first stage 2 
oligonucleotides were amplified "asymmetrically" in each of 4 reaction vessels. This resulted in 
4 populations of DNA about 220bp in length, consisting predominantly of single strands (A, B, 

C, D). Uniting A to B and C to D with amplification over 5 cycles led to 2 approximately 400bp 
long double-stranded products. Asymmetrical amplification of these DNA fragments (Stage III) 
resulted in single-stranded populations which following uniting and amplification (Stage IV) 
resulted after 10 cycles in the end-product G of about 800bp in length. This synthesis could be 
carried out without isolation of intermediate products and without renewing buffer or enzyme, 
and was completed in 3 hours. The end-product was purified eiectrophoretically, divided up 
with the appropriate restriction endonucleases, and cloned in E. coli in pBluescript 
(Stratagene), to which polylinker a Mlul and a Clal cleavage site had been added. 

C. Total sequence of gp190 5 (see Fig. 3C) 

Following fusion of all part sequences (Fig. 3A) in pBluescript, the sequence of the gene was 
checked by the di-deoxy method. The reading frame of gp190 s had a length of 491 7bp (+ 2 
stop-codons) and encoded an amino-acid sequence corresponding to that of the gp190/MSP1 
from FCB-1 (1639 amino-acids). 

D. N- and C-termini of the gp190 s1 variant (see Fig. 3D) 

The N-terminal sequence, beginning with the BamHI cleavage site, indicates the transition at 
amino-acid 20, from which it can be assumed that after splitting of the signal peptide it defines 
the N-terminus. At the C-terminus the sequence encoded ended at amino-acid 1621. The 
stop-codon followed the Clal cleavage site. 
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Example 2: Expression of op190 S2 in E. coli 

A. Expression vector (see Fig. 4A^ 

The gp190 S2 sequence was inserted via the BamHI and Cial cleavage sites into pDS56RBSII, 
by means of which 6 histidines as well as some amino-acids originating in the vector were 
fused to the N-terminus. This produces the following N-terminal sequence on the reading- 
frame: Met Arg Gly Ser (His) 6 Gly Ser. Through the promoter P N2 5iaco-i the transcription comes 
under lacR/O/IPTG control. 

B. Expression and purification of qp190 S2 (see Fig. 4D^ 

Carrying over the vector pDS56RBSIIgp1 90 s2 into E. coli DH5alphaZ1 and induction of 
synthesis through IPTG resulted after electrophoretic separation of the total protein extract 
from the culture in a clearly visible band of the anticipated size (arrow). Purification of the 
material through IMAC and affinity chromatography (antibody column with mAK5.2) led to a 
homogeneous product of about 190 kD. In the Figure M stands for molecular weight 
standards; 1 = E. coli before; 2 = after induction with IPTG for 2 hours; 3, 4, 5 = fractions from 
elution of the mAK column. 

Example. 3: Tetracvcline- controlled expression of gp190 s1 in HeLa and CHO cells and 
isolation of the product (see also Fig. 5 and 6c) 

A. The gp190 sequence was inserted via the BamHI/Clal cleavage sites into the expression 
vector pBi-5. In this way transcription of the gene came under the control of a bidirectional 
"tTA-reponsive" promoter and could be regulated through Tc. The bidirectional promoter 
simultaneously initiated transcription of the indicator gene luciferase. In consequence the 
regulation of the expression could easily be followed (see also Fig. 5A). 
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B. Immunofluorescence of HeLa cells, which express luciferase and gp190 under Tc control 

The production of luciferase (left), gp190 s1 (middle) in the absence of Tc was demonstrated in 
HtTA93-9 cells, which contain the bidirectional transcription unit of (A). Following addition of Tc 
no noteworthy synthesis of gp190S1 was shown (as represented in Fig. 5B, right). 

C. Electrophoretic characterization of gp190 s1 purified from HeLa cells 

The HeLa cell clone HtTA93-9 as well as the CHO cell clone CH027-29 have been cultivated 
with or without Tc. Cell extracts separated by electrophoresis have been analyzed with 
mAK5.2 by means of "Western blotting" (Fig. 5C); analysis of the CHO cell line is shown on 
the left, of HeLa on the right (1) = culture without, (2) = culture with Tc, (3) = non-transfected 
HtTA-1 cell line. Molecular weight standards are in each case indicated on the left. 

D. Purification of gp190 s1 synthesised by HeLa cell clone HtTA93-9 

Preparative cultivation of the HtTA line and induction of expression of gp190 s1 by withholding 
Tc permitted isolation of the gene product by affinity chromatography (mAK5.2 column). 

The polyacrylamide gel stained with Coomassie (Fig. 6C) following electrophoresis displayed a 
product consisting of gp190 Si as well as another protein of about 50 kD. The latter was not 
derived from gp190 s1 and thus originated from the HeLa cells. Its projected removal should 
nevertheless present no difficulty in principle. 

Example 4: Expression of gp190 s1 in Toxoplasma gondii and purification of the product (see 
also Fig. 6). 

A. The gp190 s sequence was inserted into the vector ppT via Mlul/Pstl. This brought the gene 
under the control of the tubulin promoter (P tU b-i) of T. gondii. The 3' untranslated region (VTR) 
originated from the main surface protein of T. gondii (SAG-1). 
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B. Expression of qp190 s in T. gondii 

Transfection of T. gondii with pTT190 led to the isolation of parasite lines which expressed 
constitutively gp190 s Immunofluorescence with mAK5.2 (middle picture) showed not only 
expression of the gene but also situated the binding of the expression product close to the 
surface of the parasite, since it, like SAG-1 , provokes the same pattern of 
immunofluorescence (right section of fig. 6B). On the left in Fig. 6B a phase contrast 
photograph of the middle picture is shown. 

C. Isolation of gp190S from T. gondii. 

By means of affinity chromatography (mAK5.2 column) gp190 s was purified from a prepared 
quantity of T. gondii (5 x 10 9 parasites). The extremely pure protein possessed the anticipated 
molecular weight, as the Coomassie-stained polyacrylamide gel indicated following 
electrophoresis (2-3 on Figure 6C). At no. (1) on Fig. 6C purified gp190 s1 from CHO cells is 
represented with molecular weight marked on the left side. 

Example 5: Characterization of op190 s with monoclonal antibodies. 

The interaction of 16 monoclonal antibodies with gp1 90 s from the various heterologous 
expression systems was reviewed by immunofluorescence on P. falciparum and T. gondii or 
by "Western blot" on the purified proteins. Complete agreement was found when the two 
parasites were compared (number of +s indicates the relative intensity of the fluorescence). 
On Western blotting 12mAK's reacted with gp190 s from E. coli and T. gondii. On the other 
hand 3 antibodies did not bind to material isolated from CHO cells. Antibodies 15 and 16, 
which recognize epitopes from the oligomorphs or the alternative allele (MAD20), did not 
react. The results are summarized in Table 1, in which ND means "not carried, out". 
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Example. 6: expression of qp190 s in heterologous systems 

1 . Expression in E. coli 

The gp190 S2 was inserted into the expression vector pDS56, RBSII, where it came under 
control of the promoter PN 2 siaco-i, which can be controlled via the lac operator/repressor/IPTG 
system (Fig. 4A). Transfer of the plasmid into repressor-producing E. coli cells, eg E. coli 
DH5alphaZ1, permitted expression of pg190 S2 under IPTG control. By means of a nickel- 
chelate column the raw product could be isolated via the N-terminal (His) 6 sequence 
introduced by the vector. An ensuing affinity chromatography on an antibody column led to an 
extremely pure preparation. Since the monoclonal antibodies used (mAK5.2) recognized a 
conformational epitope in the C-terminal region, this 2-step purification selected a full-length 
intact protein with correct folding at least at the C-terminus (Fig. 4B). 

In contradistinction to the natural material the end-product possesses 1 1 additional amino- 
acids at the N-terminus, of which 6 are histidines. It contains no N-terminal signal and also no 
C-terminal attachment sequence. The P. falciparum-specific sequence begins with amino-acid 
20 and ends with amino-acid 1621 . 

2. Controlled expression of qp190 s1 in HeLa and CHO cell cultures 

The gp190S1 was inserted into the vector pBi-5 and thereby placed under control of a 
promoter regulable by tetracycline (Tc). The Tc-contolled system was chosen for 2 reasons: 

- It belongs to the expression systems with which the highest yield is obtained in mammalian 
cells. 

- Unsecreted foreign proteins at high concentration can interfere negatively with cell 
metabolism. Synthesis of the desired product is consequently begun only after maturation of 
the culture. 
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In the construct pBi5-gp190 S1 a bidirectional promoter was activated by the Tc-controlled 
transcription activator and initiated transcription of both gp190 s1 and the luciferase indicator 
gene. In the presence of Tc the promoter is inactive. The transcription unit was transferred into 
both HeLa and CHO cells, which both synthesize constitutiveiy tTA (HtTA line: Gossen, M. and 
Bujard, H. (1992), Tight control of gene expression in mammalian cells by tetracycline- 
responsive promoters. Proc. Natl. Acad. Sci. USA 89, 5547-5551; CHO-tTA line, 
unpublished). Through cotransfection (Ca 2+ -phosphate method) with a hygromycin- 
resistance-inducing marker gene was selected for successful chromosomal integration. 
Hygromycin-resistant clones were then investigated for regularity of the expression >Tc, in 
which luciferase activity was used as indicator. The gp190 synthesis was tested in well 
regulable clones (regulation factor »Tc 1000). Immunofluorescence analysis (Fig. 5B) as well 
as investigation by "Western blot" (Fig. 5C) allowed the identification in both cell types of 
clones which synthesized gp190 under strictly regulable conditions. The best regulable of 20 
clones were in each case subcloned. The subclones HtTA93-9 and CH027-29 were used for 
cultures on a scale of 10:1. From cell extracts of these cultures intact gp190 s1 could be 
isolated by means of affinity chromatography (mAK5.2). The material was homogeneous 
except for a single cellular component which did not derive from gp1 90 S1 and made up 25% of 
the preparation (Fig. 6C). it had to be removed in a further purification step. 

3. Expression of gp190S in Toxoplasma gondii. 

Like P. falciparum, Toxoplasma gondii belongs to the Apicomplexa and consequently has a 
protein modification system apparently similar to that of P. falciparum. T. gondii can be 
transfected with foreign DNA which is efficiently integrated into the genome and furthermore 
allows problem-free multiplication of T. gondii in cell culture. To obtain a product most like 
native gp190, gp190S2 is expressed in such a way that the protein is secreted on the surface 
of the parasite and, as in P. falciparum, bound to the membrane via a GPI analogue. In that 
way the gp190S2 (Fig. 3A) has been inserted (Fig. 6A) into the plasmid ppTMCS (D. Soldati, 
unpublished) and thereby placed under the control of the T. gondii tubulin promoter, 

This expression construct was transfected into T. gondii. Selection with chloramphenicol led to 
resistant clones synthesizing gp190 which were detected by immunofluorescence (Fig. 6B). 
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The immunofluorescence with anti-gp190 antibodies was indistinguishable from a 
corresponding pigmentation of the parasites by means of antibodies against SAG1, the main 
surface protein of T. gondii. It may be deduced from this that gp190 is bound to the surface of 
T. gondii. Several T. gondii clones (Nos. 3.1 to 3.4) were characterized and saved for the 
production of gp190. Using affinity chromatography (mAK5.2) gp190 was isolated from T. 
gondii cultures (clone 3.4) cultivated on a preparative scale. Electrophoretic analysis revealed 
a homogeneous product with a migration rate similar to that of the intact protein (Fig. 6C) 

Example 7: Characterization of gp190 protein from various expression systems by means of 
monoclonal antibodies. 

A set of gp1 90-specific monoclonal antibodies, of which a number recognize conformational 
epitopes, were used to compare the reactivity of the antibodies with P. falciparum and T. 
gondii parasites via immunofluorescence. Table 1 shows that the reactivity of the 16 
antibodies with either parasite is the same. This is a strong indication that in T. gondii "native" 
gp190 is being mostly produced. Comparison of the reactivity of the antibodies with protein 
from E.coli, HeLa or CHO cells as well as T gondii shows also that most of the antibodies react 
with the 4 preparations. In particular the protein derived from E. coli recognizes more of the 
antibodies than that produced in mammalian ceils. This is apparently a consequence of 
glycosylation in mammalian cells. 

Example 8: Immunization of Aotus lemurinus qriseimembra monkeys with gp190/MSP from P. 
falciparum (FCB-1). 

Two independent immunization experiments (A, B) were carried out. In them in one instance 
(A) 1 .0 mg and in the other (B) 0.6 mg of very pure gp1 90/MSP1 was extracted from about 2 x 
10 11 parasites respectively. 

The protein was administered together with Freund's Adjuvant (FCA). The control group 
received only FCA. Immunization equally with the protein mixture or the adjuvant was done 
three times at intervals of 4 weeks. Two weeks after the last immunization each of the animals 
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was infected with 10 5 parasites (FVO strain) from a donor animal. Parasitaemia was measured 
daily. The results are summarized in Fig. 2. The symbols mean: 

T: that the animals were treated with resochin 

D: a dead animal 

Fig. 2A: individuals in the vaccinated group each received 
3 x 60 micrograms gp190/MSP1 

Fig. 2B: individuals in the vaccinated group each received 
3 x40 micrograms gp190/MSP1 

While in the control group only 1/11 animals did not develop parasitaemia, this was 6/10 in the 
vaccinated group. The four animals in the vaccinated group who did develop a pronounced 
parasitaemia did so - in comparison to the control group - with an average delay of four days 
(exceeding the 2% limit of parasitaemia). 

These experiments indicate for the first time a highly significant protection by gp1 90/MSP1 
against infection with P. falciparum in a monkey model. The process according to the invention 
consequently permits a practical vaccine against malaria to be presented for the first time. 
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Patent Claims 

1 . Process for preparing of the complete gp1 90/MSP1 protein of Plasmodium, in 
particular Plasmodium falciparum characterized in that the complete gene for 
gp190/MSP1 is expressed in a suitable system, preferably a host organism. 

2. Recombinant manufacturing process according to Claim 1 , characterized in that the 
synthesis of the DNA sequence on which the protein is based derives from the DNA 
sequence of the P. falciparum strain FCB-1. 

3. Recombinant process according to Claim 2, characterized in that the AT content of the 
DNA sequence expressed is reduced compared to that of the naturally occurring 
sequence, preferably from 74% to 55%. 

4. Recombinant process according to one or more of claims 1 to 3, characterized in that 
the gene on which the protein produced is based codes for the complete amino-acid 
sequence including signal peptide and attachment signal. 

5. Recombinant process according to one or more of claims 1 to 3, characterized in that 
the gene on which the protein produced is based codes for the complete amino-acid 
sequence except for the attachment signal. 

6. Recombinant process according to one or more of claims 1 to 3, characterized in that 
the gene on which the protein produced is based codes for the complete amino-acid 
sequence except for the attachment signal and the signal peptide. 

7. Recombinant process according to one or more of claims 1 to 6, characterized in that it 
includes the following steps: 

(a) Design of the DNA sequence from P. falciparum FCB-1 to be synthesized, by 
which a DNA sequence with the codon frequencies common in the human 
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genome and maintaining the aminoacid sequence of the FCB-1 protein would 
be produced, 

(b) division of the planned sequence into overlapping regions, preferably regions 
p83, p31, p36, gp30 and gp19, 

(c) synthesis of desoxyoligonucleotides, each of them extending the whole length 
of a region, 

(d) synthesis of the regions coding for gp19, gp30, p36 and p31 by PCR and 
synthesis of the region coding for p83 by fusion of two sequences comprising 
approximately 1200 bp, 

(e) individual cloning of coding sequences, 

(f) fusion of the complete gene and 

(g) expression in a suitable expression system. 

8. Recombinant process according to Claim 7, characterized in that the 
desoxyoligonucleotides synthesized in step (c) should be on average 120 nucleotides 
long and in each instance overlap the neighbouring sequences by about 20 bases. 

9. Recombinant process according to one or more of Claims 1 to 8, characterized in that 
dPS56, RBSII is used as expression vector. 

10. Recombinant process according to one or more of Claims 1 to 8, characterized in that 
pBi-5 is used as expression vector. 

1 1 . Recombinant process according to one or more of Claims 1 to 8, characterized in that 
ppTMCS is used as expression vector. 
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12. Recombinant process according to one or more of Claims 1 to 1 1 , characterized in that 
the DNA sequence on which the protein is based is expressed in E. coli. 

13. Recombinant process following Claim 12, characterized in that the E.coli strain used is 
the repressor-producing strain E. coli DH5alphaZ1. 

14. Recombinant process according to one or more of Claims 1 to 1 1 , characterized in that 
the DNA sequence on which the protein is based is expressed in HeLa cells. 

1 5. Recombinant process according to one or more of Claims 1 to 1 1 , characterized in that 
the DNA sequence on which the protein is based is expressed in CHO ceils. 

16. Recombinant process according to one or more of Claims 1 to 11 , characterized in that 
the DNA sequence on which the protein is based is expressed in Toxoplasma gondii or 
Leishmania. 

1 7. Complete DNA sequence, suitable for expression, of the gp1 90/MSP1 surface protein 
of Plasmodium, in particular P. falciparum, preferably obtainable through the 
recombinant process according to one or more of Claims 1 to 1 6. 

18. DNA sequence, suitable for expression, according to Claim 17, characterized in that it 
does not code for the attachment signal. 

1 9. DNA sequence, suitable for expression, according to Claim 1 8, characterized in that it 
codes neither for the attachment signal nor the signal peptide. 

20. DNA sequence, suitable for expression, according to Claim 19, characterized in that it 
includes a sequence for 1 1 additional amino-acids, of which 6 are histidines, present at 
the N-terminus. 
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21 . DNA sequence, suitable for expression, according to one or more of Claims 17 to 20, 
characterized in that the sequence includes no recognizable "splice donor" and "splice 
acceptor" signals. 

22. DNA sequence, suitable for expression, according to one or more of Claims 1 7 to 21 , 
characterized in that the sequence includes no large GC-rich sequences. 

23. DNA sequence, suitable for expression, according to one or more of Claims 17 to 22, 
characterized in that the sequence includes no recognition signals for restriction 
enzymes recognizing sequences of six or more base pairs. 

24. DNA sequence, suitable for expression, according to one or more of Claims 1 7 to 23, 
characterized in that the sequence for recognition signals of particular restriction 
nucleases in regions which, after processing of the protein, separate existing domains, 
contains uniquely occurring cleavage sites for restriction endonucleases. 

25. DNA sequence, suitable for expression, according to one or more of Claims 1 7 to 24, 
characterized in that the sequence has at both its ends cleavage sites for 
endonucleases which do not appear in the remaining sequence and in a vector for use. 

26. Host organism which contains the complete nucleic acid sequence for the gp190/MSP1 
surface protein and/or the complete protein. 

27. Host organism according to Claim 26, characterized in that the organism is E. coli. 

28. Host organism according to Claim 27, characterized in that the E. coli strain is the 
repressor-producing E. coli strain DH5aiphaZ1. 

29. Host organism according to Claim 26, characterized in that the host organism is HeLa 
cells. 
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Host organism according to Claim 26, characterized in that the host organism is CHO 
cells. 

Host organism according to Claim 29 or 30, characterized in that the host cells 
synthesize constitutively tTA. 

Host organism according to Claim 26, characterized in that the hosts provided are 
Toxoplasma gondii, Leishmania, baculoviruses, adenoviruses or yeasts. 

Use of a gp190/MSP1 protein manufactured according to Claims 1 to 16 for active 
immunization against malaria. 

Use of a gp190/MSP1 protein manufactured according to Claims 1 to 16 for the 
manufacture of monoclonal antibodies suitable for passive immunization. 

Use of a DNA sequence manufactured according to Claims 1 to 16 for the manufacture 
of a vaccine based on nucleic acids. 

Process for stabilization of gene sequences, characterized in that the AT content of the 
sequence is reduced. 

Stabilized gene, characterized in that it shows a reduced AT content than the 
unstabilized gene. 

Vector containing a DNA sequence following one of the Claims 17 to 25 and/or ' 

3% 

Host cells containing a vector according to Claim Iff. 

Vaccine containing a protein manufactured according to a process according to any of 
the Claims 1 to 16 and/or a DNA sequence according to one of the Claims 17 to 25 
and/or host according to one of the Claims 26 to 32 and/or a vector according to 
Claim Jr^ 
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ytt?. Vaccine according to Claim J3€C characterized in that it contains further immunity- 
promoting products from Plasmodium, especially P. falciparum. 
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Summary 

Recombinant manufacturing process for a complete malaria antigen gp190/MSP1 

The invention concerns a recombinant manufacturing process for the complete gp190/MSP1 
surface protein of Plasmodium, in particular P. falciparum, as well as the complete DNA 
sequence of this protein and suitable host organisms for expression of the sequence, by 
means of which the protein in its entirety can for the first time be synthesized outside the 
parasite. 

The invention opens up for the first time the possibility of manufacturing the gp190/MSP1 
surface protein in sufficient quantity; in addition there is the matter of making gp190/MSP1 
available as a vaccine. 

Finally the invention sets out a process for the stabilization of AT-rich genes (Fig. 1). 
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from FCG-T Ce ° f " atiVe (9Pl90n) and ° f the s y nthetic 9ene (gp190s) for gp1 90 



AS HKIIFFLCSFLFFIINTQCVTHESYQE 27 

gp!90 n G A TTA T TA A A T A A T ACT A A 

gp 1 9 0 3 CK^gg^ATGAAAATCATTTTCrrCCTCTG 9 Q 

ASLVKKr.EALEDAVLTCySLFQKFKHVI.NSGT S7 
gplSO 11 T C A A A AATGATTTTATA A A AT A A A 

gp 1 9 0 3 CTGGTTAAGAAACTGCAAGCTTTCCAAGAX^ 180 

ASSGTAVTTSTPGSKCSVASGGSGGSVASGGS 87 
gpl90 n A A T TT T AGTAT TCA TA CATTATCA 

gpl90 3 AGTCCCACCGCCCTTACAACCACCACACCCGCTTCTAAAGGGTCT^ 270 

ASVASGGSVASGGSVASGGSGNSRRTNPSDNS 117 
gpl90 n T T A T TCA T T T TTCA T AT TTCAA CTA TATTA 

gpl90 3 GTCSCCTCCGGCGGCAX^GTGGCATCAGGTGGCTCAGTG^ 360 

ASSDSDAKSYADLXHRVRNYLLTIXELKYPQL 147 
gpl90 A T ATTAT TTTAAA AC TCTGTA A A C ATT AC C 

gp!90 3 AGCCATTCCGACGCCAACTCCTACGCCGACCT<^ 4 SO 

ASFDLTNHKLTLCDNIHGFXYLIDGYEEINEL 177 
gp!90 n T TT A T A TT T T TA TAT AT TA 

gp!9 0 S TT C GACCTCACTAATCATATOC TGACACTG TGT CATAACATTCAT GGCTTCAAATATCT CATTGAC GGTTACGAAGA^LATCAAT GAACTC 540 

ASLVKLHrifrDLLKAKLNDVCASDYCQIPPHL 207 
gpl90 n TATAACTTTT AT A A T A TATT T AT CT 

gpi90 3 CTGTACAACTTCAATTTCTACTTCGA^ 630 

ASXIRANELDVLXKLVFGYRXPLDNIXDNVGX 237 
gplSO*" 1 ATCTATAA CTAACTG AAA AT A TTAT A A 

gp!90 3 AAGATCAGACCCAACGAGTTGGACCTZATTG^ 720 

ASHEDYIXKHXXTISNINELISESXKTrDXNX 267 
gp!30 A C A AA AATATATAT ACT G A A ? T 

g?i3 0 3 ATGCLAAGArTATATTAAAAAGAATAA^ 3 10 

ASNATKEEEXXXLYQA-QYDLS I YNXQLEEAHN 297 
gpl90 n T A A A A A A ATATTTTTCT AT A A T 

gp 1 9 0 3 AATCCAACCAACGACGAAGAAAAGAAGAACTT^TAC 900 

ASLISVLEXRIDTLKKNENIXSLLDKINSIKN 327 
gpl90 n T A A TTAAATT TT AAA C T G TATT A A 

gp 1 9 0 3 CTCATCAGCGTACTGGAGAAGCGCATAGACACC CTCAAGAAGAATGAAAATATC^AAGLAACTGCTC 990 

ASPPPANSGNTPNTLLDXNKXIEEH2KEIKEI 357 
gpl90 n CAG T ATAATTCTT A A C A A AA AT 

gp!90 3 CZTC CGC CAGCCAACTCTGGCAACACCCCTAACACGC 1030 

ASAKTIKFKIDSLFTDPLELSYYLREKNKiflD 387 
gplSO 11 T ATTTAGTA A AT A A TA AA A TT 

gpl90 3 GCCAAAACCATTAAGTTCAACATAGATTCTCTCTTTAC 117 0 

A3 ISAKVSTKESTEPNSYPHCVTYPLSYNDIN 417 
gpi90 :i AAGT ACTA TC AATTTTA T 

gp 1 9 0 3 AT CT C CGCCAAAGTCGAGACAAACGAATCAAC C GAAC CTAAT<1AATATCC CAATGG TGTCACCTACC CTCTGTCTT AT AACGAT ATCAAC 1260 

ASNALNELNSFGDLINPFDYTKEFSXNIYTDN 447 
gplSO** 1 T T A T A T TCT T T A TAT A AAG A C A T T T 

gp 1 9 0 3 AACGCTCTCAACGAGCTCAATAGCTTCGGTGACTTGL^ 1350 

ASERXKFINEIKSXIXIEXXKIESDKXSYEDR 477 
gpi90 n A AACAT T A A T A A A A ATC T A TC A A 

gp 1 9 0 3 CLAGAGAAAGAACTTTATCAACGAAATCAAGGA 144 0 

A5SXSLNDITXSYEXLLNEIYDSXFMNNIDLT 507 
gpl90 n TCT GTC TT A A A AT ATTAT AG T TATTAT 

gp 1 9 Q 3 AGCAAAAGTCTAAACGATATCAXTTAAAGAGTATGAAAA 1530 

ASNFEXHHGKRYSYKVEKLTH3NTFASYENSX 537 
gp!90 n TA TAATATT T T A A 

gp 19 0 3 AACTTCCAGAAAATGATG<^GAAAACGGTACTCTTACAA^ 1520 
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ASaNLEKLTXALKYMZDYSLRtfIVV£Kri.KYY SS7 

gp!90 n A T A A A AT TAATAATA T A ? 

gp!90 s CATAATCTTGAGAACCTCACCAAAGCTCTTAAGTATATG m Q 

ASXHLISKlESEISTLVSNIXlCOSEQLrEKKI S97 
gpl90 n ATA CAA T T A AT A AT A A C T AAA 

gp!90 a AAGAATCTCATAAGTAACJ^GAAAACGAGATCGAGACCCT^ 1800 

ASTKDENXPDrKILSVSOIVXVQVQKVLLMNK 527 

q?190 r * T AATTAAATC A A T A A TT AT A A 

gp!90 3 ACAAAAGACCAAAATAAACCAGATGAGAAGATCCTGGAGGTCTC^ 1590 

ASIDELXXTQLILKHVELKHNSaVPMSYKQElI SS7 
gpl90 n C T A A TCTAATAA TCTCCAAA 

gp 1 9 0 3 ATTGATGAACTCAAGAAGACTCAACTCATT^ 1980 

ASXQSPYYLIVLKKEIDXLXVFMPKVESLINS 637 
gp!9C a A TTTTATGTGA ATT T A T G A ATCAT A T 

gp!90 3 AAGCAGGAACCATACTACCTCATCGTJICTCAAGAAAG^ 2 070 

ASEKKNIKTEGQSDNSSPSTEGEITGQATTXP 717 
gpl90 n AAA A A TAG TGAAAC A A AATAAT 

gpl90 5 CAGAAGAAGAACAXTAAAACTGAAGGACACT^ 21 SO 

ASGQQAG SALEGDSVQAQAQEQXQAQPPVPVP 747 

gp!90 n A A A T T A A TCA A A A A A A A A A A A 

gpl90 3 GGACAACAGGCCGGTTCAOTTCTCGAAG 2250 

ASV?EAXAQVPT?'?APVNNXTENVSKLDYI,SX 777 

gpl90 n A AA A CA AAA TATA TTC T A T T A A 

gpl90 3 (TTTCCAGAGGCTAAAGCrCAAGTGCCTACACC 2340 

ASLYErLtfTSYICHKYILVSHSTKNSKIiXQY 307 
gp!90 n T A A TT A T A T A T TTGTA TCA A AT A AT 

gp 1 9 0 3 CTCTATCAGTTCCTGAATACATCCTAC^ 2430 

ASKITKEEESXLSSCDPLDLLFNXQNNIPVMY 837 
gpigO 11 ATA G A A C T AAGT A TAT AT T AAT ATA 

gp!90 3 AAGATAACCAAGGAAGAGGAiGAGTAAACTGTCCTCIT^ 2520 

ASSHFDSLKNSI.SQLrHElYEXEKVCNLYKLK 8S7 
gpl90 n T " T A AG T A A AT AT AAA TTTTA TG 

gpl9G 5 TCTATGTTCGATACCCTOlACAATTCT^ 2510 

ASD}fDKIKNI*I*EEAXXVSTSVXTl*SSSSKQ?I, 397 

gpl90 n TT A ATTATA G A A A A A T AAGTTCA A T A 

gplSO 3 GACAACGACAAGAT/TAAGAACCTTGTGGAGGAAGCTAAGAAGGTCT^ 2700 



ASStTPQDKPSVSANDDTSBSTNLNNSLXLrS 927 
gpi90 n AT A GTA ATATTTAATTATTG TAGTT ATA A 

gp 1 9 C fl TCTCTCAC^CCTCAAGACAAGCCCGAAGTGAGCGCTAAC 2790 



AStflLSLGXNXNIYQELIGQXSSElfFYEKILK 9S7 
9?I90 n AT AG TAACATA TAATA AGTAGT A T T A T A 

gpl90 9 AACATCCTGTCTCTCGGCAAGlAATAAGAACATr^ 2880 

ASD5DTrYNESFTK?VXSXADDIJlSLNDSSKR 987 
gpl90 n T T T T T ATCT TATTA T TTATGT A AG 

gp 1 9 0 3 GACAGCGACACATTCTATAACGJ^GCTTCACTAACTTCC^ 297 0 

ASKKLEEDIUKLXXTLQ^SrO^YNXYXLXLER 1017 
gpl90 A AX A ATT AT AAA TT A GT ATCA TTTATTAT T A T A A 

gp 1 9 0 3 AAGAAGCTGGAAGAGCACATCJ^TAAGCTGA 3060 

ASLFDXKKTVGXYXMQIXXLTI.LKEQLESKLN 1047 

gpL90 n TATTA A TTA A ATAACt'taTAAAATA TCA T G T 

gp 1 9 0 3 CTCTTCGACAAGAAGAAC^CAGTCK^IAAGTATAAGAT 3 ISO 

ASSiHNPXHVLQKrsVFrWKKKEAEIAETENT 1077 

gp!90 n TTCAGTTAA TTT T AAA T A A A A T A A 

gp 1 9 0 3 TCACTGAACAATCCGAAAC*CGTACTGCAGAACTTCr^^ 324 0 

ASLSHTXILLKHYXGLVKVYNGESSPLXTiSE 1107 

gp!90 n TAA A A AT AT G TT ATTA T A A AT AAT AAGT A 

gpl 9 0 a CTGGAGAACACCAAGATT^TTTCTCAAACACT^ 3330 

ASESIQTEDNYASLSNEKVLSKLEGXLXDNLN 1137 
gplSO 11 ATCA ? A A A TT TT A A T A AT AAG AT A A AT A TTTAT 

g?19 0 a GAGAGCATCCAGACCGAGGATAACTACCKCACCCrrCGAGAACTTCAAGGTCCTGT^ 3420 
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ASLEKKXLSYLSSGLHHLIAELXEVIXNXNYT 1157 
gpl90 n T A A A AT ATCA T A A T T A TT ATT AT A A AAATATTA 

gp!90 a CTGGAGAAGAAGAAGCTCAGCTACCTCTCTAGC^^ 3510 

ASGNSPSENNTDVtfliALESYICXFLPZGTDVA? 1197 
gpl90 n T TCT T A G T T C T T A AATCA TAA 

gpl90 S GGCAAT AGC CCAAGC GAGAAT AAT ACAGAC CTGAATAAC GCACT GGAATCTTACAAGyAAGTTC CTGC CTGAAi^GAACAGATCTC G-C CACT 3600 

ASVVSESGSDTLEQSQFXKPASTHVGAESNTI 1227 
gplSO 11 T AAG AG A TAA AAG AAA A AATCA 

gpi9Q S CTCGTGTCTGAACTCTGGC^CGAC^^ 3690 

ASTTSQSVDDEVODVIIVPirGSSSEDYDDLG 1257 
gp!90 Xl A A A T T A A A A A ATA ATC A A T T TT A A 

gp i 9 0 * ACCACATCTCAGAACGTCGACGAXCAGGTCGAXGA^ 3 T 3 0 

PSVIQKILSKIENEYEVLYLK 1237 
A ATT TAT TGTTATA 

GTGATTGATAACATTCTGTCCJ^AAATCGAGAACC^ 3370 

ASPLAGVYRSLKXQLENNVHTrHVHVKOILHS 1317 
gpl90 a T A T T AAG TAA AT A A T AT TTT TTA TTCA 

gp!90 a CCTCTGGCAGQCGTCTATAGCTCTCTCAA 3960 

ASRFNXRENFXNVLESDLIP-KDLTSSlfYVVK 1347 
gpi90 n A AC T A T T A ATCA T A A TT A A AAG T T A 

gp!90 a CGCT^rTAATAAGAGJ^AAAATTTC^ 4050 

ASDPY.XFLttXEKR'DXFLSSYNYIXDSIDTDIN 1377 
gpl90 n T TATT AAA CT AAGC TT T TAATG A 

gpl90 3 CACCCATACAAGTTCCTCAArAAAOAGAAGAGGGATA^ 4140 

ASFA!lDVLGY.YKILSEXYXSDLDSIXXriNDX 1407 
gpl90 n T A TTA T A AT ATC TAATTATA A C A 

gp 1 9 0 3 TTCGCTAATGAXGTGCTGGGGTATTACAAGATCCTGAGC 4230 

ASQGEHEKYLPFLNNIETLrKTVNDXlDLFVI 1437 
gp!90 n TA G CT TT ACTTG TATA TTT TTTAT 

gp!90 a CAAGG^GAGAATGAAAAAT^rCTCCCCTTC 4320 

AS HLEAKVLNYTYEKSNV2VK IXSLNYLXT XQ HS7 
gpl90 n TTAAAATAT AT ATCA CA AAATTTA T 

gpl90 fl GACCTGXiAGGCCAAGGTCCTCAAjCTATAC^^ 4 410 

ASDXLADFKXNNNFVGIADLSTDYNENNLLTX 1497 
gp!90 r * AT TA T T TT AAA T T CT AT A 

gpi90 3 GACAAGCTCGCAGATTTCAAjGAAAAATAACA 4S00 

ASFiSTGMVFSHrLXSTVLSNiI.DGHLQGMI.NI 1527 
gpl90 a C TAGT AT TT TTT C TT ATCT TATTAT AT T A T 

gp!90 a TTTCTGTCCACTGGCATGGTGTTCGAAAAC 4S9 0 

ASSQHQCVKXQCP0NSGCFS.HLD2HEECXCLL 1557 
gpl90 n A A A A A T A A TCT A A TAT AA A AT A T AT A 

gpl90 fl TCCCAGCACCAATGCGTCJU^AAACJUGTGCC^ 4 680 

ASNYKQEGDXCVESPNPTCNENNGGCDADAKC 1537 
gpl90 n T T ATTAT T TTC T TA TA C T 

gp 1 9 0 * AACTACAAACAAC^^AG^AGATA^ 4770 

AST-EEUSGSNGKKITCECTXPDSYPLFDGIFC IS17 
5P190'* 1 A TTCA TAGC TA ATTT T TT C 

gpl90 3 ACCGAGG^GACAGCGGCTCTAACGGAAAGAAAATCACAT^ 4 8 SO 

ASSSSNFLGIFFLLrLKilLYSF:** 1S39 
gp!9C n AGTTC C TAAACA T AT A A T A AT A T T 

gp 1 9 0 3 TCCACCTCTAATTTCCTGCkSCATCTTCTTC 4 940 

atop codon CI a 1 
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